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Abstract 

In this paper, we develop a new decoding algorithm of a binary linear codes for symbol-pair read channels. Symbol- 
pair read channel has recently been introduced by Cassuto and Blaum to model channels whose write resolution is 
higher than its read resolution. The proposed decoding algorithm is based on a linear programming (LP). It is proved 
that the proposed LP decoder has the maximum-likelihood (ML) certificate property, i.e., the output of the decoder is 
guaranteed to be the ML codeword when it is integral. We also introduce the fractional pair distance dfp of a code 
which is a lower bound on the pair distance. It is proved that the proposed LP decoder will correct up to [d/p/2] — 1 
pair errors. 


I. Introduction 

Recent advances in data storage technologies make it possible to write information to storage devices with high 
resolution. However, it causes a problem of the gap between write resolution and read resolution. In the case that 
read resolution is lower than write resolution, it is difficult to successfully read the individual symbols written to 
the storage devices. The symbol pair read channel proposed by Cassuto and Blaum IT] is the channel model that 
models reading process of such storage devices. On each reading operation, two consecutive symbols are read. We 
need new error-correction framework for the new channel model. In the reading process, symbol-pair errors would 
occur rather than individual symbol errors, where a symbol-pair error means at least one of the symbol pair is read 
erroneously. The main task is to construct error correcting codes and decoding algorithms that can correct many 
symbol-pair errors as possible. 

Some error correcting codes and decoding algorithms have been studied. Cassuto and Blaum HI built a framework 
for correcting pair errors, studying a new metric called pair-distance and its relation to pair error correction capability. 
They also provided code constructions and decoding algorithms and derived some bounds on the size of optimal 
symbol-pair codes. Cassuto and Listsyn El proposed algebraic construction of cyclic symbol-pair codes and studied 
its property. Yaakobi et al. El proposed efficient decoding algorithms for the cyclic symbol-pair codes. Chee et al. 
m established a Singleton-type bound for symbol-pair codes and constructed codes that meet the Singleton-type 
bound. Hirotomo et al. fS] proposed the decoding algorithm for symbol-pair codes based on the newly defined 
parity-check matrix and syndromes. 

In this paper, we develop a new decoding algorithm of a binary linear codes for symbol-pair read channels. We 
establish a decoding algorithm which is based on a linear programming (LP). The LP based decoding algorithm 
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(LP decoder) of a binary linear codes for memoryless channels was originally proposed by Feldman et al. ||6| and 
studied further by many researchers. The LP decoder has many attractive properties as follows. 

> The output of the decoder is guaranteed to be the maximum-likelihood (ML) codeword (ML certificate property) 

M- 

• LP decoding with expander codes achieves the capacity of any binary-input memoryless symmetric LLR- 
bounded (MSB) channel I?). 

• There exist algorithms which efficiently solve the LP 

It is natural to expect that the decoder based on the LP also works well for symbol-pair read channel. Not only 
establishing the LP decoder for symbol-pair read channel, we also investigate its property. We show that the LP 
decoder has ML certificate property even when the channel is symbol-pair read channel. We also introduce the 
fractional pair distance d/p of a code which is a lower bound on the pair distance. It is proved that the proposed 
LP decoder will correct up to \dfp/2] errors. We examine the average fractional pair distance of randomly chosen 
LDPC codes from an ensemble of Gallager ifTOll . 

The rest of the paper is organized as follows. In Section 2, we introduce basic notation and definitions for 
symbol-pair read channels. In Section 3, we establish the LP decoder for symbol-pair read channels and in Section 
4, some properties of the proposed LP decoder are given. We conclude the paper in Section 5. 

11. Preliminaries 

In this section, we briefly review the model and definition of the symbol-pair read channel and error-correcting 
codes. Throughout this paper, we deal with binary linear codes and let codeword symbols E = {0,1}. Let n denotes 
the code length and C C E" denotes a binary linear code. For a length n vector x = (xq, ■ ■ ■ ,Xn-i) € E", the 
symbol-pair read vector of x, denoted by 7r(a;), is defined as 

7r(a;) = ((xo,Xi), (xi,X 2 ), ■■■ , (Xn-2,Xn-l), (Xn-l,Xo)). 

Unless stated otherwise, all indices are taken modulo n. The pair-distance between vectors a; G E” and y G E" is 
defined as 

Dp{x,y) = DH{Tt{x),'K{y)) 

= |{i : {x^,Xi+i) f , 

where Dh denotes the usual Hamming distance. Similarly, the pair weight of x is defined as 

Wp{x) = WH{Tt{x)) 

= |{z : {xi,Xi+i) f (0,0)}|, 

where Wh denotes the usual Hamming weight. We say that symbol-pair error is occurred at i-th position when at 
least one of two bits {xi,Xi+i) are changed during reading operation. 
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We assume the probabilistic model for symbol-pair read channel. Let p < | be the probability that the symbol-pair 
error occurs and let 

{ I — p if a = b „ 

a,6 e {0,1} . 

I otherwise 

Let UG CE X E)” be the received pair vector. We assume that pair errors occur independently at each position i 
and 


p{y k) =p(y k(a;)) 

n —1 

where Vi and {Tr{x))i are the ith symbol-pair of y and Tr{x), respectively. We call Dh{t^{x), V) the number of 
pair errors when the x is the sent codeword and y is the received pair vector. 
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III. LP DECODING FOR SYMBOL-PAIR READ CHANNEL 
When y is the read pair vector, ML decoding problem is described as 

minimize — lnp(y \x) 
s.t. X &C. 

This problem includes the ML decoding problem for memoryless channels and therefore it is NP-hard in general 
HD. Here, we derive the relaxed problem. 

Let be the indicator function which represents whether (7r(a;))i = (a, 6) or not, i.e., 

1 if (7r(a;))i = (a, &) 

0 otherwise 


'^i,{a,b) (^) — 


. Note that T : S" 


We also define the map T : E" -5. {E^}” as T{x) = [{n,(a,b)(^)) (a,b)&{s^) 

{E^}" is an injection and the relations Xi = Ti^(^i^o){x) -f (a;), Xi+i = Tj_(o,i)(a;) + (at) hold. 

From the assumption for the channel, — lnp(?/ \x) can be written as ~^'‘^P{yi k(®)i). Given Vi, 

piVi |(7r(a;))i) is a function of (7r(a;))i. Let Xi^(a,b) = — inp{yi |(a, b)), (a, b) G E^ then, 

-lnp(2/i |(7r(at))i) = ^ X,^(^a,b)Ti,(a,b)ix). 

(o,&)GE2 

We define A = (Aq, • • • , A„_i) A^ = {Xi^(^a,b)){a,b)GS^ > * = 0, • • • , n — 1, and let A be the map from y to A i.e., 
A = A(y). 

Summarizing above, the optimization problem (|2|l can be converted to the following optimization problem. 


minimize {X,t{x)) 

s.t. G {0,1}, t = 0,.. .,n - 1, (a,6) G E^ 

S(a,6)eE2 Ti,(a,b) (®) *=0, ...,71— 1 

Xi = Tj,(i,o)(a;) + Tj,(i4)(at), i = 0,... ,n - 1 
Xi+I = T,,(o,i)(a;)-f r,_(i_i)(a;), i = 0, ...,n-l 
X gC, 


( 3 ) 
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where t{x) = T{x) and (A,x(a;)) = K,{a,b)Ti,{a,b){^)■ Note that the constraints other than x G C 

are redundant constraints that are automatically satisfied if at S C. 

We only rewrote the problem (|2]i in another form and therefore it is still hard to solve the optimization problem. 
Then, we relax the constraints r, f,) € {0,1} to b) € [0,1] and x G C to x G Q{H), where H is the parity 
check matrix of C and Q{H) is the fundamental polytope of C Ih). Q{H) has the following properties Q 
. C c Q{H) 

. Q(iT) n{0, If =C 

• X G Q{H) can be expressed by 2n + inequalities (m is the number of rows of H and dj is the 

Hamming weight of the ith row of H) 

The relaxed problem is described as follows. 

minimize (A, t) 

s-t. e [0,1], i = 0,...,n-l,{a,b) GY.'^ 

S(o.&)GE 2 = 1, t = 0, . . . , n — 1 

i 0,...,7Z 1 

Xi+1 = f = 0,..., n — 1 

a; e Q{H). 

Note that r = (tq, • • ■ = {Ti^(^a,b))(a,b)es'^ are the optimization variables that are independently defined 

from X. If we use LDPC code for C, (|4|i is solvable in the polynomial order of n. 

The LP decoder for symbol-pair read channel consists of the following steps. 

1. Calculate A from y. 

2. Solve the LP (IDl. 

3. If the optimal solution x* is integral (a:* G {0,1}"), output x* and declare “error” otherwise. 

As in the case for the memoryless channel, the following theorem holds. 

Theorem 1: If the LP decoder outputs a codeword, it is guaranteed to be the ML codeword. 

Proof: Let (a:*, t*) be the optimal solution of dUi. Its objective value is smaller than or equal to that of other 
feasible solution {x,t). For any codeword x' G C, it is easily verified that {x',T{x')) is a feasible solution for 
(l4|i. In the following, we will confirm that no other feasible solution of the form (a;', r) exists. 

First, if x' is integral, t must be integral because assumption that x is fractional leads to the contradiction. 
For example, if 0 < (i_o) < 1 then 0 < Ti_(i i) < 1 must holds from the constraint (i_o) + 'ti,(i.i) = Xi and 
assumption that Xi is integral. Then (o,i) = 1 — (i i) from the constraint (g^) +Ti (i i) = Xi+i and assumption 
that Xi+i is integral. Then Y(a,b)&s^ A.(a,b) > ^^,( 0 , 1 ) + ^.(i.o) + A.(i.i) = 1 - T'i.(i,i) + A,( 1 , 0 ) + > 1 and 

it contradicts the constraint Y(a b)G's^ '^i,{o-,b) = 1- We can lead contradiction in other cases similarly. 

Further, there doesn’t exist integral r such that {x',t) is a feasible solution of (|4]i unless r = T{x') because 
T : E" —> is an injection. 

Therefore {x*,t*) has cost less than ore equal to feasible point that corresponds to other codeword x'. ■ 

We will show later in Lemma [T] that if {x*,t*) is a vertex of the feasible region of (IDl then x* is the vertex 
of Q(H). When the factor graph of the code is cycle-free, Q{H) has no fractional vertex [|6]. Therefore the 
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proposed optimization problem (|4| is an exact formulation of the ML decoding problem of the binary linear code 
for symbol-pair read channels in the cycle-free case. 

IV. Analysis of LP decoding for symbol-pair read channel 

Let Pr[err|a;] be the probability that the LP decoder makes an error, given that x was sent. It can be described 
as 

Pr[err|a;] = Pr {3(a;', r') s.t is a feasible solution of Q, (A, r') < {X,T{x))} . (5) 

Here, we analyze the performance of LP decoder for symbol-pair read channels. 

A. The All-Zeros Assumption 

It is common to assume that the transmitted codeword is the all-zeros vector, because the assumption makes it 
easy to analyze the performance of the decoder. Although it is not straight forward to verify that we can assume 
the assumption, we will prove that it is valid for the LP decoder for symbol-pair read channel as it is valid for 
memory less channel 16]. 

Theorem 2: The error probability of the LP decoder for the symbol-pair read channel ([T]) is independent of the 
transmitted codeword. 

Proof: See Appendix lAl ■ 

According to the theorem, we can assume that the transmitted codeword is all-zeros vector 0. 

B. Fractional Pair Distance 

Here, we introduce the concept of fractional pair distance d/p of a code C whose parity check matrix is H. 
Definition 1: The fractional parr distance d/p of a code C whose parity check matrix is H is defined as 


min WfJx) 

*eVQ\{o} 

n—1 

(6) 

®eVQ\{0} ^ 

(7) 


where Vq is the set of vertices of the fundamental polytope Q{H) and 

Wfp{{a,b)) = max {a, b} (8) 

The fractional distance of the code is defined as d/ = mina-gy^^^Q} in ||6] and the fractional pair distance is 
an modified notion of it so that it is suitable for the symbol-pair read channel. As a trivial relation, d/ < d/p 
holds from the inequality Xi < max{xi,Xi+i) = Wfp{{TT{x))i. The pair distance of a code C is defined as dp = 
miiiccecxo Wp(x) |[T]. It holds that d/p < dp from the fact that C C Vq and Wfp{{a, b)) < 1. 

The performance of the LP decoder is characterized by the fractional pair distance as stated in the following 
theorem. 

Theorem 3: For a code C with fractional pair distance d^^, the LP decoder can correct up to \d*j^/2\ — 1 pair 
errors occurred through the channel ([T]i. 
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Before proving the theorem, we prove the following lemma. 

Lemma 1: Let (x,t) be a vertex of the feasible region of (|4]l. Then a; is a vertex of the fundamental polytope 
Q(ff), i.e., X € Vq. 

Proof: See Appendix iBl ■ 

We are now ready to prove theorem |3] 

Proof: {Theorem 12) We can assume that the transmitted codeword is 0 from theorem |2] The decoding failure 
of LP decoder means that the optimal solution (at*, r*) satisfies at* f 0. From the assumption that the code has the 
fractional pair distance and Lemma[T] it holds that ^27=0 — ^*fp- Let ^ I V (0j0)| 

be the set of positions that pair errors occurred. 

To (0, r°) be a feasible solution of (|4]i, = T(0) must hold and o) “ ^ 

for i = 0, ..., n — 1. Since (at*, t*) is the optimal solution of (|4|i, it holds that 


_ _0 — T-f — D 

i,(o,i) “ L,(i,o) “ A,( 1 , 1 ) “ ^ 


(A,x*) <(A,x°). 

Subtracting the right-hand side from the left-hand side in (|2, we obtain 


(9) 


(A,T*) - (A,r°) = ^ K(a,b){.Tl(a,b) - 

i=0 (a.h)Gi:= 

^ ^ ^ ^ (L,(a,b) '^i,{a,b)) ^ ^ ^ ^ ^id^,b)iTi,{a,b) ^i,{a,b))' (1^) 

iee {a,b)£S^ i^e {a,b)£S^ 

For i € £, it holds that 

{ — In I if (a, b) = (0,0) 

-ln(l-p) if(a, &)=2/i 
— In I otherwise. 

Therefore the first term in (fTOl l can be expanded as 

/ 

-In I (<(0,0) - r°(o.o)) -ln(l ^ 

\ (o,&)GE=\((0.0)uyi) 

/ 


E 

iGS 


ln^( 


2 ^ ‘ i,{a,b) i,{a,b)) 


= E 

i^S 


-ln3(r*(o.o) -l)-ln(l-F)r*« - 


V 


E 

(a,h)GE2\((0,0)Uy, 


ln|<(a.6) 


= E - E In !<(„_,) -ln(l-p)r* 


i^£ 




\ (a,b)&Y.-^\y i 

^ (- In |(1 - rVp - ln(l - p)t‘« J + |^| In | 


in In I 


i^£ 


3(l-p)^y 
For i d £'\t hods that 


( 11 ) 


A. 


2 ,(a,6) 


— ln(l — p) if (a, b) = (0, 0) 

— In I otherwise. 


( 12 ) 
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Therefore the second term in (fTOl t can be expanded 




i^S 


(a,6)eS2\(o,0) 


= X! “ “ p^Kiofi) -1) - 


i^S 


{a,6)eS2\(0,0) 


P. 

^ ‘ z,(a,b) 


= II In(l-P) I] <(a,b)-lnf ^ 


i^S 

= ^ln 

i^S 


(a,b)eE2\(Q_Q) 


'i,(a,b) 


(a,b)GE2\(0,0) 


3(1-p) 


E 


'i,{a,b)' 


(a,6)eS2\(0,0) 


Multiplying dTOjl by (^In 3 ( 1 ^) , Q 


E “ E E <(a,b) < i-^i - E E 

' i^e (a,&)Gi:2\(0,0) 2^5 (a,6)eS2\(0,0) 


n — 1 


si^i-E E 

i=0 (a,b)gE2\(o.O) 

< 2 |f I - d% 

< 0 , 


i,{a,b) 


E E ■ 

2^5 (o,6)eE2\(o,0) 


2 ,(a,6) 


(13) 


(14) 


where the first line follows form t*„ < 1 and the third line follows from J2(a &)ge^\(o o) 


(a,6)GE2\(0,0) 


= max{a;-,a;-+i} 

= Wfp{{Tr{x*))i) 


>d%- 


(15) 


The inequality (fT4l) suggests that (fTUl) > 0 and it contradicts (|9l). Therefore x* = 0. 


C. Computing the Lower Bound of the Fractional Pair Distance 

In ©, the efficient way to compute the fractional distance of an LDPC code is presented. We can compute the 
lower bound of the fractional pair distance of an LDPC code in a similar way. The reason that only lower bound can 
be computed is explained later. To compute the fractional pair distance, we have to compute min 3 ,gyg\{o} 

Note that W7p(a:) is a convex function of x since it can be expressed as the maximum of linear functions. Let IF 
be the set of facets in Q{H) which 0 doesn’t sit on. We choose a facet in F and minimize Wfpix) over that facet. 
The obtained minimum is a lower bound of the values of Wfp{x) of the vertices in the facet. The minimum value 
obtained over all facets in F is the lower bound of Wfp(x) over all vertices other than 0. 

'note that ^In < 0 when p < 0.75 
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Fig. 1. The average fractional distance df and the lower bound of the fractional pair distance dfp as functions of length for a randomly 
generated LDPC code, with variable degree 3, check degree 4, from an ensemble of Gallager do). 


Fig. m shows the average fractional pair distance of a randomly generated LDPC code. The generated LDPC code 
has variable degree 3, check degree 4, and is randomly generated from an ensemble of Gallager IfTOll . It seems that 
the lower bound of the fractional pair distance grows linearly with the block length. The fractional pair distance is 
about 1.8 times as large as the fractional distance. Since the fractional pair distance is a lower bound of the pair 
distance, the figure also gives a lower bound of the pair distance of the LDPC code. 

D. Numerical Experiments 

Here, we see the performance of the proposed LP decoder through numerical experiments. We used a randomly 
generated rate-1/4 LDPC code with variable degree 3 and check degree 4. As one benefit of formulating the decoding 
problem as an LP problem, we can obtain ML decoder by solving the corresponding integer linear program. Fig. 
|2] shows an error-rate comparison for a block length of 60. There exists a gap between the performance of LP 
decoder and that of ML decoder. We hope that this gap can be filled by tightening the relaxation as in the case for 
memory less channels ||6]. 
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pair error probability p 


Fig. 2. A comparison between the performance of LP decoding and ML decoding using the same random rate-1/4 LDPC code with length 60. 


V. Conclusion 

We have developed an LP based decoding algorithm of binary linear codes for symbol-pair read channel. We also 
proved some results on its error-correcting performance. We introduced a notion called the fractional pair distance 
of a code, which characterize the lower bound of the number of correctable pair errors that the proposed algorithm 
can correct. 

There are some future directions of the work presented here. We assumed that the pair error occurs at each 
position independently. However, this assumption may not hold for some cases. The LP decoding for finite-state 
channels is proposed in ifTSll and we can consider the symbol-pair read channel with finite state. To develop the LP 
decoding algorithm for such channel is a future work. 

In this paper we assumed that the code is binary. The LP decoding for nonbinary linear codes for memoryless 
channel was presented in flTl . It would be interesting to develop the LP decoding algorithm of nonbinary linear 
codes for symbol-pair read channel. 
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Appendix A 
Proof of theorem|2] 

In this appendix, we prove theorem|2] Recall that Pr[err|a;] is the probability that the LP decoder fails, given that 
X was transmitted. It suffices to show that 


Pr[err|a;] = Pr[err|0], 

for an arbitrary transmitted codeword x € C. Dehne B{x) as follows. 

(16) 

B{x) = |2/G (E^)” 1 3{x',t') s.t. {x',t') is a feasible solution of (|4]i, x ^ x', {\,t' 

) < (A,r(ai))}. (17) 



Note that A is a function of the received vector y. B{x) is the set of received vectors 

failure when x was transmitted. We can rewrite (|5]l as follows. 

y that cause decoding 

Pr[err a;] = ^ p{y \x). 

'v^B{x) 

Applying the above to the codeword 0, we obtain 

(18) 

Pr[err|0]= ^ p(y |0). 

y&B{p) 

(19) 

In a similar manner to the proof in Q, we will show that the space (E^)" of received vectors can be partitioned 

into pairs (?/, Vq) such that 

.•f4- . _ _ 

Pr[ 2 / 1®] =Pr[2/o |0], 

( 20 ) 

2/G B{x) 2/oe B{0). 

( 21 ) 


These, along with (fTSl l and (fT9] l. give 


Pr[err|a:] = Pr[err|0]. 

We partition the space of received vectors as follows. For an arbitrary received vector y, dehne Vo as follows. Let 
{yo)^ be {yo)i = {y)i ©p ( 7 r(a;))i, where ®p denotes the pairwise exclusive OR, that is, (ai, 6 i) ©p ( 02 , 62 ) = 
(oi © 61,02 © 62 ). Then (2/0)1 = (0,0) if and only if (2/)i = and therefore, 

Pr[y I®] =Pr[yo |0], (22) 

holds. Note that the operation to make Vq from y is its own inverse and it gives a valid partition of into 

pairs. 

Now it remains to show that 2/G B(x) if and only if 2/oG ^(0). Before proving it, we introduce some lemmas. 
Lemma 2: Let {x^^r-f) be a feasible solution of (|4]i. We dehne the relative solution (a;’',r’') of {x ^to 
a; G C as follows. 


r _ / 


= 0 , 


, - l,(a, 6 ) G 


(23) 


= '^*.( 1 , 0 ) 


Ai.i)’ 


i = 0,..., n — 1. 


(24) 
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Then the relative solution is a feasible solution of Q. 

Proof: e [0,1] for i = 0,..., n - 1, (a, b) G = 1 for i = 0,... ,n — 1 are 

obvious. From the definition of the relative solution, 

r r f f 



1 ) if (7r(a;))i = (0,0) or (1,0) 

<(0.0) + <(1.0) = 1 - (<(0.1) + <(1.1)) if = (O’ f) or (f’ f)> 


(25) 


^i+l -'^z+l.(1.0) ''i+l.(l.l) - ''i+l.(1.0)©p(7r(^))i+i ■^''z+l.(l,l)©p(77(a;))i+i 


+ T 


J 


J 


^i+l,il,0) + <(1.1) 

J 


if (7r(a;))i+i = (0, 0) or (0,1) 


<+i.(o.o) + <+i.(o.i) = f - «+i.(i.o) + <+i.(i.i)) if (’r(a^))i+i = (1,«) or (1,1). 

(26) 

Considering the fact that the second element of (7r(a;))i equals to the first element of (7r(a:))i+i, it always holds 

Ft.( 0 . 1 ) ■l"Fi.(i,i) ~ Fi+i.(i,o) Fi+i.(i,i) ~ ^i+i fiom Xi+i = <(o.i) +<(i.i) = '^i+i.(i,o) +<fi.(i,i)- 

we show that x'' G Q{H). From the definition, 


~ ^^.(l.o) + Fi,(l,l) 

<( 1 . 0 ) + <( 1 . 1 ) = < if (’r(a;))* = (0,0) or (0,1) 

J 


<( 0 , 0 ) + <( 0 . 1 ) = 1 - < if (’^(a:))* = (1> 0) or (1,1) 


= l< 


(27) 


It is proved in @ that a:’', whose elements are calculated from dZTl) . satisfies a;’' G Q{H). ■ 

For the relative solutions, following lemma holds. 

Lemma 3: For an arbitrary x G C, an arbitrary feasible solution {x^ ,t^), and an arbitrary received vector y, it 
holds that 


(A,T7-r) = (A0,r’'-r0), 

where A = A(y), t = T[x), A° = A(yo), = r(0), and r'’ is the relative solution of to x. 
Proof: From the definition of A°, it holds that 

n 

<(a.b) = -lnp(y* |(a,6)) 

= -lnp(yj ©p(7r(a;))i|(a, 6)) 

= -lnp(yj \{a,b) ©p (7r(a;))j) 

'^i,(a,6)©p(7r(a!;))i • 


Therefore, for * = 0, 


, n — 1, it holds that 


^ ^ '^z,(a,b)'^'i,(a,b) ^ ^ '^t,(a,b)©p('7r(a5))i't)^^Q^^^0^^7,.fa5^^^ 

(a,b)eE2 (a,h)Gi:2 

(a,6)Gi:2 


(28) 


(29) 


(30) 


( 31 ) 
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and (A,t^) = (AO,t’’). Further, from Eia,b)G^^ hia,b)Ti,ia,b) = E(a,6)eE= ^°(a.b)^°(a.b) = ^°(o.o) 

and A°(o o) = it also holds that (A,t) = (A°,t°). ■ 

We can prove the following lemma from Lemma |2] and Lemma [3 

Lemma 4: Fix a codeword x G C. For an arbitrary feasible solution {x-f , t^) of (|4|i, x^ ^ x, there exists another 
feasible solution (a;’’,r’’) such that x'" ^ 0 and 

(A,x/-T) = (AO,r’'-xO). (32) 

Further, for an arbitrary feasible solution of (|4]l, a;’’ ^ 0, there exists another feasible solution (x^, t^) 

such that x^ ^ X and 


{\°,T^ -T°) = {\Tf -t). 


(33) 


Proof: Let (a;’',T'’) be the relative solution of {x-^,t^) to x. From Lemma|2l (a;’',T’’) is a feasible solution 
of (|4]l. From the definition, x^ 0 and applying Lemma [3 we obtain 


{X,T^ -t) = (A^r'-rO). 


(34) 


The latter of the lemma can be proved by replacing the role of (x^, t^) and (a;’’, r’’). ■ 

We are now ready to prove that VG B[x) if and only if J/qG ^(0) it completes the proof of the Theorem 
121 Assume that UG B{x). From the definition of B(x), there exists a feasible solution {x^, t^) of (l4|i that satisfies 
x^ X and 


{\t^-t)<Q. (35) 

From Lemma m there exists a feasible solution (a:'',r’’) such that a;'" ^ 0 and 

(A°,r’'-x°) < 0. (36) 

It means VoG B{0). The other direction can be proved with the symmetric argument. 


Appendix B 
Proof of LemmaU 

It is known that a vertex of a polytope of dimension N is uniquely determined by N linearly independent facets 
of the polytope which the vertex sits on m. The dimension of the feasible region of (|4|i is 5n. Let {x,t) be an 
arbitrary vertex of the feasible region of (|4]i and hence it sits on 5n linearly independent facets in dUi. We will 
show that (a;, x) sits on at most An linearly independent facets in (|4]l which are not included in a; S Q{H). It leads 
that X sits on at least n linearly independent facets in a: S Q{H). Since x satisfies at most n linearly independent 
equality, x sits on just n linearly independent facets in a; G Q{H) and therefore x G Vq. 
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Here we show that an arbitrary vertex (x, t) of the feasible region of Q sits on at most An linearly independent 
facets in (|4]i other than the constraints in at G Q{H). Let Ei,E 2 , E 3 , E 4 be the sets which are defined as 


£i 

— {z G {1,.. 

. ,n} 

1 Xj and Xj+i are integral}, 

(37) 

£2 

= {i €{!,.. 

. ,rz} 

1 Xj is integral, Xj+i is not integral} , 

(38) 

£3 

= {z G {1, .. 

. ,n} 

1 Xj is not integral, Xj+i is integral}, 

(39) 

£4 

= {z G {1, .. 

. ,n} 

1 Xj and Xj+i are not integral} . 

(40) 


We show that for any i G Ej,j = 1,... ,4, and x^+i satisfy at most 4 linearly independent 

constraints in © other than the constraints in a; G Q{H). 

In the case i G Ei, = 1 and Ti^(^a,b) = ^ {xi,Xi+i). Other constraints are not linearly 

independent from these constraints and G {0,1}. Therefore just 4 linearly independent equality constraints 

Xi,{a,b) G {0,1} , (a, h) G Y? are satisfied. 

Next, we consider the case i G i?2- First we assume that Xi = 0. Then (i^o) = ^^,(1,1) = 0 holds from the 

constraint Xi = t, (1 g) +3"^ (1 1). In this case, it holds that 0 < (0,0)1 ri_(o,i) < 1 otherwise it leads contradiction to 

the assumption Xi+i is not integral (For example, if Xj (q q) = 0 then (04) = 1 and Xi+i = (04) +7^ (1 1) = 1). 

The equality Xi = Tj (1 0) + is linearly dependent to the equalities Tj (1 0) = 0 , Tj_(i 1) = 0 and Xi = 0 . 

Therefore just 4 linearly independent equality constraints (1 0) = 0 , Tj (^ 1) = 0 , X](o b)GE2 'L,(a,b) = Xi+i = 
Ti,(o,i)+Ti,(i,i) are satisfied. When Xi = 1 then Ti^(o,o) = Tj_(o,i) = 0 holds from the constraints Xj = 
and E(a, b)gE2 Ti,{a,b) = 1 - It be shown that just 4 linearly independent equality constraints are satisfied in a 
similar way for the case Xj = 0. 

For the case i G E^, we can prove that just 4 linearly independent equality constraints are satisfied in a similar 
way for the case i G £'2 by exchanging the role of Xj and Xj+i. 

Finally, we consider the case i G £4. From the assumption that 0 < Xj < 1 and constraint Xj = Tj (1 0) +Tj (1 1), 
it turned out that one of the three cases a) Tj (1 0) = 0,0 < Tj_(i 1) < 1 or b) Tj_(i 1) = 0, 0 < Tj (1 0) < 1 or c) 
0 < Tj_(i_o),Tj,(i,i) < 1 occurs. 

a) We consider the case that Tj (1 0) = 0, 0 < Tj (1 1) < 1 hold. It holds that Tj (0,0) + Ti,(o,i) = 1 — Xj from 
the constraints Xj = Tj_(i^o) + Ti,(i,i) and Yl,(a,b)€:^2 ’’’i.fa.b) = 1- ’’'i.fo.o) = 0 

or a- 2 ) Tj^(o,i) = 0 or a- 3 ) 0 < Tj,(o,o),Tj,(o.i) < 1 occurs. 
a- 1 ) If we assume Tj_(o_o) = 0 then Tj^(o,i) = 1 —Xj and Xj+i = Tj^(o,i)+Tj^(i^i) = 1. But it contradicts 
the assumption Xj is not integral and it never happens. 
a- 2 ) We assume Tj_(o,i) = 0. From the constraint Xj = Tj_(i_o) +Tj,(i4) and the assumption Tj (i^o) = 0, 
Tj (1 1) = Xj. Therefore, Xj+i = Tj (04) + Tj_(i 1) = 0 + Xj = Xj holds. Above argument shows 
that the constraint Xj+i = Tj (o_i) + Tj (i ^) is linearly dependent to the constraints Tj (^ q) = 0, 
Ti,(o,i) = 0, Xj = Tj (1 0) + Tj (1 1) and just 4 linearly independent constraints Tj_(i qj = 0, 
Ti,(o,i) = 0, E(o.&)ge 2 Tj,(a,b) = 1, a:j = Tj_(i^o) +'^*.( 1 , 1 ) are satisfied. 
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a-3) When 0 < Tj (o_o)) 7^,(0,!) < 1 holds, 4 linearly independent constraints r, (i q) = 0, 

X](o,&) 6 E 2 T'i,(a,b) = 1, + ''’*,(1.1)’ -^*+1 = ''’*.(0,1) + ''*,(1,1) Satisfied. 

b) We consider the case that (i.i) = 0, 0 < (i q) < 1 hold. As in the case a), it holds that (op) + 
A.( 0 , 1 ) = 1—aii and one of the three cases b-1) Tj_(op) = 0 orb-2) Tj_(o,i) = 0 orb-3) 0 < Ti_(op), Ti_(o,i) < 
1 occurs. 

b-1) Consider the case that (op) = 0 holds. Then Tp(o,i) = 1 — Xi and Xi+i = Tj (op) + ^,( 1 , 1 ) = 
I — Xi holds. Above argument shows that the constraint Xi+i = Tj (op) + (ip) is linearly 
dependent to the constraints Ti^(op) = 0, Ti^(ip) = 0 and Xi = Tp(ip) -I- Tp(ip). Therefore 
just 4 linearly independent constraints 'rp(op) = 0, (ip) = 0, X](a f))eE 2 ''’*,( 0 , 6 ) = 1 and 
= ''■*.( 1 , 0 ) + ''■*.( 1 , 1 ) are satisfied. 

b-2) If we assume that Tp(op) = 0 holds then Xj+i = Tp(op) + ''i,)!,!) = 0, but it contradict the 
assumption Xi+i is not integral. Therefore this case never occurs. 

b-3) When 0 < Tj (op),''■i,(o.i) < 1 holds, 4 linearly independent constraints Tj (^p) = 0, 

X](o,b)GE 2 ■'’*.( 0 , 6 ) = 1 , aij = Tp(ip) + fp)! !), Xj+i = Tp(op) "f fp)! !) are satisfied. 

c) Assume that 0 < Tp(ip), rp(ip) < 1 holds. As in the other cases, it holds that Tp(op) -f ''■p(op) = 1 — Xi 
and one of the three cases c-1) Tp(op) = 0 or c-2) Tp(op) = 0 or c-3) 0 < Tp(op), r^po,!) < 1 occurs. 

c-1) When Tp(op) = 0 holds then Tp(op) = 1 — Xi and therefore 0 < Tp(op) < 1. In this case, 4 
linearly independent constraints Tp(op) = 0, I](a,b)eE 2 ''■i.(a.b) = 1, Xi = Tp(ip) + Tp(ip) and 
Xi+i = rp(op) + rp(ip) are satisfied. 

c-2) When Tp(op) = 0 holds then Tp(op) = 1 — Xi and 0 < Tp(op) < 1 hold. Therefore 4 linearly 
independent constraints Tp(op) = 0, I](a,b)eE 2 ''■i.(a.b) = 1, x* = ri_(ip) -f npip) and Xi+i = 
+''■^.( 1 , 1 ) are satisfied. 

c-3) Assume that 0 < 'rp(op) >''■i,(o,i) < 1 holds. In this case, 3 linearly independent constraints 

I](a.&)GS 2 Ti,(a,b) = 1, Xi = Tp(ip) -f Tp(ip) and Xi+i = Tp(op) -f Apip) are satisfied. 
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